Source of Acquisition 
NASA Ames Research Center 



Introduction 


This tutorial is designed to provide information for both those new to Grids, and 
those who wish to incorporate Grid Services into their applications. 



m 


What we will cover: 

- Introduction 

• What are Grids? 

• Current State of IPG 

• Overview of Grid Middleware 

• Future Directions 

- Using the Grid 

• Prerequisites 

• Using Basic Components of the Grid 

• Accessing Grid Services from an application 

- Programming Models In Use 

• Parameter Studies 

• Multi-Disciplinary 

• Data-Mining 

- Data Management With the SRB 





Our Theme 


I’m a scientist who has 

- a design to study 

- a computer code to run a simulation 

- data about the design object 

- heard about the Grid and I’m curious 
Tell me more! 


Grids 


• Grids provide the infrastructure 

- to dynamically conglomerate independently managed: 

• Compute resources (generic and unique) 

• Data sources (static and live) 

• Scientific Instruments (Wind Tunnels, Microscopes, Simulators, etc.) 

- to build large scale collaborative problem solving environments that are: 

• cost effective 

• secure 

• Grid software is “middleware” 

• Grids are made up of a collection of independent 
resources (computers, storage, instruments, networks, 
etc) that can be tightly or loosely coupled, to solve a 
particular problem, as needed, via grid software 
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Why Grids? 


Characteristics Usually Found in 
w Grids ' 

For NASA and the general community today Grid 
middleware: 

- provides tools to access/use data sources (databases, 
instruments, ...) 

- provides tools to access computing (unique and generic) 

- Is an enabler of large scale collaboration 

• Dynamically responding to needs is a key selling point of a grid. 

• Independent resources can be joined as appropriate to solve a 
problem. 

- Provides tools for development of Frameworks 

• Provide tools to enable the building of a frameworks for applications. 

• Provide value added service to the NASA user base for utilizing 
resources on the grid in new and more efficient ways 


• An underlying security infrastructure such as the Grid Security 
Infrastructure (GSI), which is based on public key technology 

- Protection for at least authentication information as it flows from resource 
to resource 

• Readily accessible information about the resources on the Grid via a 
single mechanism, the Grid Information Service (GIS) 

• Single sign-on 

• A seamless processing environment 

• An infrastructure that is scalable to a large number of resources 

• The ability for the grid components to cross administrative boundaries 






Before the Grid 


User 


1 Application 1 


/l\ 

The User is responsible for 
resolving the complexities of 
the environment 

/ / 
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work 
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Site A 

" Site B 



independent sites 
independent 
hardware and 
software 

independent user ids 
security policy 
requiring local 
connection to the 
machine. 
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IPG Components 


Globus 2.0 

- Grid Information Services (GIS) 

• Monitoring and Directory Service (MDS 2.0) 

- Data Transfer: 

• Servers: GridFTP, GASS, gsissh 

• Clients: gsi-ncftp, gsi*scp 

Portal 

- Launchpad 

Certificate Authority 

- local to IPG 

Data Management 

- storage resource broker 

IPG Services 

- Resource Broker vl 

- Job Manager vl 


State of the IPG 


Globus Toolkit 


Server Nodes 

- 1024 node, single system 
image SGI at Ames 

- 512 node SGI 02K, Ames 

- 256 node SGI 02K, Ames 

- 1 28 node Linux Cluster, 
Glen Research Center 

- 64 node SGI 02K, Ames 


- 24 node SGI 02K, Glenn 
Research Center 

- 24 node SGI 02K, Ames 

- 1 6 node SGI 02K, Langley 

- 8 node SGI 02K, Ames 

- 4 node SGI 02K, Langley 


Client Nodes 

— 1 6 node SGI 0300, JPL 

- 4 node SGI 0300, Goddard 

Wide area network interconnects of at least 100 Mbit/s 
Storage resources: 50-100 Terabytes of archival 
information/data storage uniformly and securely 
accessible from all IPG systems 


Grid computing middleware 

- Software between the hardware and high-level services 

- Basic libraries and services 

Most common middleware used in grids 
Primary implementation in C 

- All libraries, command-line programs, client and server 
Secondary implementation in Java 

- JavaCoG 

- Many of the client libraries 
Current version is 2.2.3 

- 2.0 installed on many IPG systems 

- 2.2.3 installed on a few 
http://www.globus.org 









Globus Software Architecture 


Monitoring and 
Discovery Service 


obus Deployment 




•login 

•execute commands 
•copy files 


•get and put files 
•3rd party copy 
■interactive file management 
•parallel transfers 

/ •execute remote applications 
•stage executable, stdin, 
stdout, stderr 


Grid Security Infrastructure (GSI) 


Globus Resource Allocation 
Manager (GRAM) 

I | PBS II LSF II fork/exec I 


credentials for users, 
services, hosts 


•authentication 
•secure communication 


\ job management 
systems 

•single sign on 
•delegation of credentials 
•authorization 


1 Current IPG Services 



\ ASA ] 

Current IPG Services j 



ORAM Job X party cc 


Job Manager 

- Reliably execute a job 

• Set of files to pre-stage 

• Executable to run 

- Including directory, environment variables 

• Set of files to post-stage 

Resource Broker 

- Provide suggestions on where to run a job 

- Input 

• Which hosts and OSs are acceptable 

• How to create a Job Manager Job for a selected host 

- Selection made using host and OS constraints and host load 

• Interactive system: # free CPUs 

• Batch system: Amount of work in queue / # CPUs 

- Output 

• Ordered list of Job Manager Jobs (suggested systems) 








Ongoing Work 


Open Grid Services Architecture 
Globus Toolkit version 3 
IPG Services 
Data Grid 


New framework for creating grid services 
Based on web services 

- Standards to build and use distributed services 

• Service description language: WSDL 

• Service invocation: SOAP 

• Service location: UDOI (not used in OGSA) 

OGSA extends web services with: 

- Requirements for service interfaces such as providing service data and 
notifications 

- Service management (creation, destruction, lifetimes) 

- Security 

Implemented atop Apache Axis 
Standardizing in the Grid Forum 

- Architecture: http://www.ggf.org/ogsa-wg 

- Implementation: http7/www.gridforum.org/5_ARCH/OGSI.htm 


Globus Toolkit Version 3 (GT3) 


Large change from GT2 to GT3 

- Entirely new implementation 

- Java-based instead of C-based 

- GT3 based on OGSA 

GT3 will provide equivalent services to GT2 
Alpha version of GT3 currently available 
IPG won’t transition to GT3 soon (guess a year) 

- Need version 1 and time to evaluate 

Transition should have minimal impact on our users 

- Globus will maintain many of the existing programs 

- IPG Services will switch to GT3 in a transparent manner 

http://www.globus.org/ogsa 


IPG Services 


Goal: Location-independent computing 
Intelligent resource selection 

- User-spedfied preferences and requirements 

- Resource characteristics, access, allocations, cost, load, performance predictions 
Create system-dependent job from system-independent job 

- Given a system (or systems) to use 

- Pick directories, set paths and environment, specify which executables and 
libraries to move, specify which data files to move 

Reliable execution of system-dependent job 

- Application execution & file management 
Dynamic access and accounting 

- Run applications without a permanent local account 

- Charging resource use to remote accounting systems 

- Resource pricing 
Workflow management 

- Specify and execute workflows 
Implemented as OGSA services 
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Prerequisites 


What I need to make use of IPG Services 

• Get an account 

• Create an IPG Certificate 

• Learn how to use and access Grid Services 
- stay tuned for the rest of this tutorial 



Getting an Account 



• Easiest way is to use the on-line account request 
form at: 


http: / / www . ipg . nasi . gov/ipgua*rs/gettingstart«d/*ccounta . html 

• Ames, Glenn, and Langley accept approved 
requests from this form. 

• JPL and Goddard are just joining as client sites 
and have separate forms at this time. 


Group ID )BT 

• The account request form requires a Group ID or 
GID 

- Obtained from a Principle Investigator (PI) 

- Approves your account request 

- Provides a group ID (GID) 

• To become a PI 

- Write a proposal for a Grid Related Project 

- Contact the NASA Center you are most likely to be 
affiliated with 



I’ve Been Approved! 


I received information about my accounts 
I can log in 

Select a Grid system to be your “Home” system 
Terminology: “Grid System”, or Grid-enabled 
system is one that is running Grid middleware. 
- At a minimum my “home” system should have: 

• The Globus client commands 

• The Grid Security Infrastructure (GSI) libraries 

• Allows interactive login 

• Will hold my credentials 
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Getting My Certificate Part 2 l|y 


Display Certificate Information lE 

• I’ve got mail! Instructing me to execute the command 
ipg-cert-retrieve 

- Included in the mail is the exact format of the command 1 should 


• Use the grid-cert-info command to display 
your certificate information 
- For example: 

% grid-cert-info -all 

Issuer: O-Grid, 0=National Aeronautics and 
Space Administration, OU=Ames Research Center, 
CN=Certificate Manager 
Validity: 

Not Before: Nov 20 20:30:18 2002 GMT 
Not After : Nov 20 20:30:18 2003 GMT 
Subject: O-Grid, 0=National Aeronautics and 
Space Administration, OU=Ames Research Center, 
CN=George B . Myers 

execute to get my signed certificate 

- Example: 

% ipg-cert-retrieve -id 0x14b 

• A directory named . globus is created 

- In it you will see two files: 

• user cert . pem - is your public key 

• user key . pem - is your private key 

• Grid Administrators are notified to add you to a file that maps your 
subject name to your user ID on all systems you have access to 

• More information at: 


http : //www . ipg . nasa . gov/ipguser s/gettings t*rted/cert_top . html 



^ «■ 





What’s Next? 


OK, 

- Got my accounts, 

- got my certificate, 

- I’m ready to go! 

Not so fast... 

- we want to introduce the concept of delegation. 

- The idea is to give processes on the remote system to 
act on your behalf. 

- In Grid Security Infrastructure this is done with what is 
called a proxy. 
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While We’re on Proxies 


MyProxy ^ 

• Can 1 access the Grid from a Web Portal 

- The answer is yes, however a Portal needs a way to 
represent you 

- The MyProxy Service provides this capability 


• A Client/Server method of storing a proxy for later 
retrieval from things like Web Portals 

• Using the secure socket technology of the web a proxy is 
stored on a secured server 



• The proxy has a limited life time and a different pass- 
phrase than that of your certificate. 



• The proxy is retrieved by the Portal on your behalf and 
used in the same fashion as on any other Grid system. 





The myproxy-init command 


• The myproxy-init command is executed on a Grid system where 
your certificate resides. Using SSL, and parameters and a 
passphrase you provide your proxy is stored on the MyProxy Server. 

• Example: 

% myproxy-init 

Your identity: /O-Grid/O-Nationxl Aeronautic* and Space 

Adainis tration/OO-Aroe* Research Canter/OKeorge B. Myers 
Enter GRID pass phrase for this identity: 

Creating proxy 

Done 

Your proxy is valid until Thu Jan 23 1C:40:3C 2003 
Enter MyProxy Pass Phrase: 

Verifying password - Enter MyProxy Pass Phrase: 

A proxy valid for 1(1 hours { 7.0 days ) for user gayer s now exists on 

ayproxy ipg . nasa . gov . 

% 

• Also a myproxy-destroy command to remove your proxy from 
the Myproxy Server 








What’s Next? 


• Ok, 

- Got my accounts 

- got my certificate 

- got my proxy 

- I understand delegation 

- and I have authenticated myself on a resource or two 

• What’s next? 

- Now let’s try to log into a resource and see what single 
sign-on is all about 







Logging Into a Resource 



• Interactive access to a resource may be limited. Check 
with local policy to be sure. 

• For those resources that allow interactive access: 


% gsissh hosta.nasa.gov 
... [motd, etc.] 

[51] [sharp] > 

This also would verify ability to authenticate to the 
system. 

• And it doesn’t matter if my ID on the remote 
system is different. 


Don’t Forget to grid-proxy-init 


Another Example 



% globus run -s -r hosts.nass.gov 
graa_init failure: 

Credentials Expired 

proxy expired: run grid-proxy-init or wgpi first 
File*/ tap/x509up_u4772 
Function :proxy_init_cred 

CSS status: ssjor : OOObOOOO ainor: 0000041a token: 00000000 
Error initialising GRAM: authentication with the reaote server failed 
% grid-proxy-init 

Your identity: /(^Grid/0*Nation«l Aeronautics and Space 

Adeinistrstion/OU-Ames Research Center/CM-Ceorge B. Myers 
Enter GRID pass phrase for this identity: 

Creating proxy 

Done 

Your proxy is valid until Sat Jan 23 01:43:19 2003 
% globus run -a -r hosta.naaa.gov 


Here the only due is you are prompted for a password: 

hostb% gsissh hosta.nasa.gov 

gmyars0hosta.nasa.gov' s password: #[raverts to ssh] 
hostb% grid-proxy-init 

Your idantity: /0=Grid/0=National Aaronautics and Spaca 
Adminis tration/OU=Araes Research Ca 
nter/CN=George B. Myers 

Enter GRID pass phrase for this identity: 

Creating proxy Dona 

Your proxy is valid until Mon Feb 3 00:51:57 2003 


GRAM Authentication test successful 


hostb% gsissh hosta.nasa.gov 

Last login: Wed Jan 29 14:26:46 2003 from hostb.nasa.gov 
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What is a Job? 



• A job is: 

- Work to be done 

- That has Environmental Attributes 

- That has Resource Requirements 

• An Example: 

- An executable to analyze data 

- That requires INS3D and runs in /scratch 

- That needs 1 28 CPUs, 2 Gigabytes Memory and 
needs 2 wall-clock hours to run 

• Another Example: 

- A simple command such as Is or a script with a series 
of commands 


BBS 7- 


Traditional Model 
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Globus Mode! 



. My Home system on the Grid 


% grid-proxy- ini t 

% gr id_f i l«_copy_c oramand f il«a_on_mxaator to_resourc«_a 
% grid_fil«_copy_comraand filab_on_masator to_r«aourc«_b 
% .execut*_grid_job on_risourc«_» 

% | execute grid_ job onreaourca b 


Globus 


Middleware 


resource_a 


resource_b 


Masstorage 


GRAM 



Globus Resource Allocation Manager 
(GRAM) is the Globus middleware 
component that handles the submission 
of jobs 

Client/Server Model 

Client-side r - 

- commands 

- API’s in C and Java (COGkit) 
Server-side 

- Gatekeeper for Authentication 

- Job Managers to handle job execution 
Uses the Resource Specification 
Language (RSL) to identify job 
requirements 


•execute remote applications 
•stage executable, stdin, 
stdout, stderr 



Globus Resource Allocation 
Manager (GRAM) 


| PBS || LSF || fork/exec J 



job management 
systems 






GRAM Server-Side 


GRAM Gatekeeper 

- uses the GSI libraries 

- Authenticates the user proxy 

- Delegates control on the user’s behalf via a job-manager 

GRAM Job-Manager 

- At least one will be present 

- The “fork" job-manager had the job directly to the system 

- Other job-managers hand the job to a Job Management System to be 
scheduled according to local policy 




GRAM Client-Side 


Client-side commands provide basic user interface 

- globus min is the primary GRAM command-line interface 
provided by Globus 

GRAM API’s provide ability to access GRAM from 
programs 

- C language API available 

- Java language API provided in the Globus COGkit 


globusrun Command )£ 


Simple Example jK 

• Two ways to execute the command: 


Simple command with RSL on the command-line and 

- Wait for job to execute and output to come back to terminal 


wait for output to come back: 

- Don’t wait, and get a Globus Job ID handle 



• Two ways to specify Resource Specification Language 


% globusrun -r hostc.nsss.gov "&(sxscutsblo*/usr/bsd/hostnaiM) " 

(RSL) 


% 

- On the command-line 



• Ok for one or two parameters 


Let’s not wait for the output: 

• Tedious for large numbers of parameters 


% globusrun -b -r hostb.nsss.gov (sxscutsbl«=/usr/bsd/hoatnsms) * 

• However, simpler for parameter substitution 


GRAM Job subaiaaion successful 

httpa : / /hoatb.naaa. gov: 10298/52663465/1043801266/ 

- In a file 


% globua- job-atstua httpa: //hoatb. naaa.gov: 10298/52663465/1043801266/ 

• Better with large number of parameters 


PENDING 

« 

• Saves retyping parameters 





— 








A Side Note 


Be aware that when executing the globusrun 
command interactively (I.E. waiting for output to 
return to the terminal) that output may not return 
immediately 

- IF the compute resource has no fork job manager 
You might be waiting a long time if there are a lot 
of jobs ahead of you in the queue. 




GRAM Model 


Let’s follow a job through the GRAM interface 





Another Example 


For a slightly more complex example with RSL in a file: 


% cat Badiua.nl 
& (count- - 1”) 

( jobtypa-" singla" ) 

(axacutabla— " /lc/gnyar a/ script") 
(maxHallTiaa— 30 ) 
(atdout-/lc/gmyars/BBdiuB. out) 
(atdarr-/lc/gnyars/madiua. arr) 


I cat script 
I madiun»_»cript 
/bin/data 

/bin/acho Ran on ' /uar/bsd/hostnama ‘ 
/bin/acho * ” 

/us r/ local /pbs /bin/ qatat -au gnyars 
/bin/data 


Had Jan IS 1S:03:36 PST 2003 
Running on hostb 

Had Jan IS 13:03:36 2003 

Sax-rar raports 1 job total (R:l Q:0 B:0 H:0 T:0 1:0 

avalyn : 1/C nodaa uaad, 10 CPU/245Qab fras, load 2.30 (R:l T:0 1:0) 

Rag'd Raq'd Klap 

Job ID Oaarnaaa Quaua Jobnaaa SaasID TSK Msmory Nda wallt S wallt 

4600. hostb gayars hostb STD IN 3160473 2 4 90mb 1 00:05 R 00:00 

Had Jan IS 13:03:36 PST 2003 


About RSL 


count - identifies number of processes desired (CPUs) 

jobtype - identifies how to initiate the job 

j ob type* "mul tiple " - starts count processes (1 / CPU)(default!) 
jobtype*" single" - start a single process to which count CPUs 
are applied 

jobtype="mpi" - starts an MPI job that uses count CPUs 

maxWallTime*" seconds" - set the job time request to wall time-good 
for dedicated resource allocation 

maxCPUTime*" seconds" - set the job time request to CPU time-good 
for shared resource allocation 

maxMemory="bytes" set memory constraints 

executable*" file__name" - path to the executable script or command 
you want to execute itot 
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‘Comments on RSL 


An executable can be anything from a command, application or 
script 

J ob t ype=mul tiple might be used where you have your own inter- 
process communication 

j ob type=single would be used when using OpenMP, and other 
multithreading technologies 

Be aware that the use of maxMemory is not consistent through the 
Grid. Some sites consider it to mean memory per job, while others 
consider it to mean per process. 




What’s Next S" 

• Ok, 

- I now know how to submit a job to a grid resource 

- I know more than I ever wanted about GRAM 

- I know what to expect when I execute globusrun 

- I know about RSL and how to use it 

• Now What? 

- How do I move my data around in the environment? 

• Its simple, 

- There exists modified FTP and SCP commands 

- Plus a client that allows 3rd party transfer 


c/ 




File Transfer Methods 


Grid-enabled File transfer protocol API - GridFTP 
gsincftp - NCFTP with GSI that uses GridFTP 
globus-url-copy - enabling third party transfers 
gsiscp - GSI enabled SSH copy command 




Data Movement 


Client 

. globus_url_copy , 
y gsincftp 


Server 

►http server 


gsiscp ■ 


7—°i 

. gridFTP server 
►gsissh server 


19 








E 







What’s Next 


• OK, 

- I understand how the basics work 

- I know how to submit a job 

- I know how to move my data 

• What’s Next? 

- Putting it all together. 

• Creating scripts to allow me to stage my actual input files, run 
my application, and get my results. 



INS3D 



has been used in numerous 
applications, 

- analysis of the fuel turbopump for the 
Space Shuttle's main engines. 

- to make major design improvements to 
a heart device that could save 
thousands of lives each year. tDeBakey 
Ventricular Assist Device 

INS3D code solves the incompressible 
Navier-Stokes equations in three- 
dimensional generalized coordinates 
for both steady-state and time varying 
flow. 

It is written in Fortran 77, with some 
extensions and several routines in C. 
The source code comes ready to 
compile and run on either a Cray 
computer or a Silicon Graphics 
workstation. 


l 



21 






‘Example 1 


#! /bin/csh 

sat v«rbos« 

• Stag* file* from horn* machine to compute node 

gsincftpput hoste.nese.gov . /di r I PGRunDemo \ 

. /dirINS3D/dirI PGDemo/ ins3d. in \ 

. /dir INS3D/ dir I PGDemo/ xy* . fat \ 

. / dir INS3D/dir I PGDemo /bcraain . det 

• Run my job 

globus run -s -f ins3d.rsl 

• Get output date from compute nod* put beck on home machine 

gsincftpget hoste.nese.gov \ 

. /dir INS 3D /dir I PGDemo/ dirOutput \ 

. /dirlPGRunDemo/ ins3d. out . /dirIPGRunDemo/fort.20 \ 

. /dirZPGRunOemo/fort . 30 . /dirlPGRunDemo/ fort . 31 \ 

. /dirlPGRunDemo /fort. 33 ./ dirlPGRunDemo /cf. det \ 

. /dirlPGRunDemo/ cp .det 


What’s Next? 


- I have learned how the grid works 

- Submit a job 

- Move data 

- Put it all together in a script for my application 

What else is there? 

- How do I find resource I don’t already know about? 

• You use the Grid Infromation Service (GIS) 


‘Example 2 


• ! /bin/csh 


$MSS_tJRL» gsiftp : //lou . nss . nasa . gov/u2/s3/klotx/dirIMS3D/dirDeeo 
$WODE_URL- gsiftp : //sharp . as . nren . nasa . gov/u/klotx 

• Pre-stage files fro* mass storage to compute node (3rd party) 
globus-url-copy $MSS_0IU./ins3d.in $MOOK_URI./dirIPGRunD«mo/ins3d . in 
globus-url-copy »MSS_URL/bcmain.dat $NODE_URI. /dirlPGRunDemo/bcmain .«iat 
globus-url-copy $MSS_URL/xyx.fmt $MODE_URL/dirIPGRunDemo/xy* . fmt 


:- stage files fro* compute node to mass storage (3rd party) 
i-url-copy $NODE ORL/dirIPGRunD«mo/ins3d. out )K33_URI. /ins3d.c 
i-url-copy JMODE_URI./dirIPGRunD«mo/fort. 20 $KSS_URL /fort. 20 
i-url-copy $MODE_ORI./dirIPCRunDs«o/fort. 30 $MSS_URL /fort. 30 
l-url-copy $NODE URL/dir IPGRunOemo/ fort . 31 5MSS_URI. /fort. 31 
i-url-copy $NODE_URL/dirIPGRunDemo/fort. 33 JK3S_UR1. /fort. 33 
l-url-copy $M 0 DE_t?RI./ dirlPGRu nD em o / cf . dat $MSS_ORL /cf.dat 
l-url-copy $H 0 DE_URL/ dirlPGRunDemo/ cp .dat $HSS_URL /cp.dat 


What is the GIS? 


distributed directory service 






How Do 1 Query the GIS? 


grid-info-search 

• There are a number of tools to help you query 
GIS to find resources and information about 
them: 


• Fairly complex syntax 

• To use it well requires some knowledge of LDAP and the 
data in the database 

- grid-info-search command, 

- LDAP browser 

- IPG Resource Broker Service. 

- Launch Pad 


• Example search: 

% grid- info- saarch -x -T.T.T. -h hostb.nasx.gov -b "Mds-Vo- 
nwi>s»ipg,o»nasa,o»grld" ” (Ob j«ctclass=MdsHost) " dn 

da : Mda-Boat-hn-hoa tb . aasa ■ gvr ,Hda-Vo-n»— -jpg , o—naaa , o~gr id 

dn : Mds-Boat-hn-hoa ta . naaa . gov , Mda- Vo-naaa-ipg , o-naaa , o«gr id 

— — 


» 





LDAP Browser 


• An LDAP browser provides a web based interface 
to search the Grid information database. 

• Still requires knowledge of the structure of the 
data-the object-classes and attributes. 

• Demo of LDAP Browser 

• Browse the data maintained by LDAP server 

• Use to browse Globus MDS 

• Select a directory server, connect and open a 
session 

• A nice LDAP Browser can be found at: 


http://www.mcs.anl.gov/~gawor/ldap 
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LDAP Browser - MDS Data 



1 LDAP Browser - Search Filter (contd.) 

► 


r- ^ 

\ASA 

/ t 

Launch Pad Resource Page ! 



The IPG Web Portal, Launch Pad, has a resource 
page that shows the various resources available 
on the Grid 

Demo Launch Pad Resource Page 

Launch Pad is accessible from the IPG Home 
page, or at: 

https : //portal . ipg . nasa . gov/launchpad/ sarvlat/launchpad 






*What’s Next 


• Ok, 

- Now I can use all the basic grid tools to 

• Learn about resources 

• Write scripts to package the grid tools for my application 

» Is there anything that makes this a bit easier? 

- Yes, 

• IPG has developed higher level services that help you with 
some of the items you have already learned 

- IPG Job Manager 

- IPG Broker 
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Job Manager API Summary ^§T 


Constructors 

JobManager ( ) 

JobManagar Job ( ) 

Specify executable parameters 

JobManagar Job . sa tHost ( ) 

JobManagar Job . ae tExecutable ( ) 

Specify pre-stage pairs 

JobManagar Job . addPreS tagePair ( ) 

Specify post-stage pairs 

JobManagarJob . addPoatS tagePair ( ) 

Specify state change listener 

JobManagarJob . addLis tenar ( this ) 

Submit job 

JobManagarJob . submit (JobManagar) ; 



SQp Job Manager Submit Job pt 


JobManager jobManager = new JobManagerQ; 


JobManagerJob job = createJob(); 
job.submit(jobManager); 


Creating JobManager Job - 1 


Creating JobManagerJob -2 



btagerair pair! = new 

JobManagerJob createJob() { 


StagePair(StagePair.PROTOCOL_GRIDFTP, 

JobManagerJob job = new JobManagerJob(); 


"mss.nasa.gov” 

job.setHostfhosta.nasa.gov") ; 


“Vmss/dirlNS3D/dirDemo", 

job.setDirectoryfJdirlPGRunDemo"); 


"ins3d.in", 

job.setExecutable("./ins3d dp"); 


”hosta.nasa.gov H , 

job.setNumCpus(l); 


"VdirlPGRunDemo"); 

job.setWallTimeMinutes(5); 


// files to be pre-staged 

job.setStdinLocation("ins3d.in"); 


job.addPreStagePair(pair1 ); 

job.setStdoutLocation("ins3d.out"); 



job.setStderrLocation( ,, jns3d.err ,, ); 


// files to be post-staged 



job.addPostStagePair( ); 

*4 
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Resource Broker API Summary 


Lining the Broker jB 

Setup 


Broker broker = new BrokerClient(); 

BrokarClimnt () 

Simple Job ■ BrokerClient . create Job () 


SimpleJob job = broker.createJob(); 

Specify job requirements/attributes 


Simple Job . setNuroCPUs (int numCPUs) 

Simple Job. setExecutebleForOS (String exec. String oa) 
Simple Job. setOirectoryForHost (String dir, String host) 


// configure job - see next slide 

- 


JobManagerJobQ suggestions = job.suggestSystems(); 

Suggest systems 



JobManager Job [ ] = 

Simple Job. suggestSystems (int maxSystems) 


JobManager jobManager = new JobManager(); 



suggestions[0].submit(jobManager); 





Configure Broker Job 


Using the Broker 

job.setNumCPUs(l); 


Broker broker = new BrokerClient(); 

job.setWallTimeMinutes(5) ; 

job.setDirectoryForHost("./dirlPGRunDemo", "hosta.nasa.gov"); 
job.setDirectoryForHostf./dirlPGRunDemo", "hostb.nasa.gov"); 
job.setStdinLocatlon("ins3d.in"); 
job.setStdoutLocation("ins3d.out"); 
job.setStderrLocation( M ins3d.err"); 


SimpleJob job = broker. create Job(); 
// configure job - see next slide 

/identify files to be pre-staged 


JobManagerJobQ suggestions = job.suggestSystems(); 

StagePair pairl = new StagePair(...); 
job.addPreStagePairForHostfpairl, "hosta.nasa.gov"); 


JobManager jobManager = new JobManager(); 

//identify files to be post-staged 
StagePair pair4 = new StagePair(...); 
job.addPostStagePairForHost(pair4, "hosta.nasa.gov"); 


suggestions[0].submit(jobManager); 

— 








6^ IPG JobManager, Broker 




What’s Next? Iff 


• Ok, 

- Now I think I am pretty knowledgeable about the Grid 
environment, but I have a question? 

- Earlier, you talked about MyProxy, but we have not 
used it yet for my application. 

• What gives? 

- Well, as we mentioned, MyProxy is for web portals to 
have a way to access your proxy to execute 
commands on your behalf. 

- We will now show you a portal IPG has build and how 
you could use it for you application if you preferred a 
web interface. 



& IPG Web Portal |ff 

• Launch Pad is the IPG Web Portal 

• You can do everything we have demonstrated 
and more in Launch Pad 

• Security is provided using a proxy the same way 
it is from any system on the grid. 

• The proxy is retrieve from a server called 
MyProxy that stores proxies for this specific use. 
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Launch Pad Step 2: Select System 


Launch Pad Step 3: Submit Job 


Dk*ctary.AiAser/sc demo/ 
name - OMP_NUM_THREADS 


Number of CPU 1 »;4 
Wal lame: 15 
Memory Aller ated; 

Okie here to nice: 


Source Horcielecred hoa South Dirtcterjr Run 
Deitmation Kiituo(iui u>a|or D.rUnaUon D 
fiker.MMOOT.- 


Outline 


Introduction 

• What are Grids? 

• Current State of IPG 

• Overview of Grid Middleware 

• Future Directions 
Using the Grid 

• Prerequisites 

• Using Basic Components of the Grid 

• Accessing Grid Services from an application 
Programming Models In Use 

• Parameter Studies 

• Multi-Disciplinary 

• Data-Mining 

Data Management With the SRB 
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Parameter Studies 


Evaluate a system under different conditions 
Examples: 

- Aerospace vehicle at different speeds and angles of attack 

- Climate change with different rates of greenhouse gas emissions 
Run a simulation multiple times with different input parameters 
Problems 

- Specifying the input parameters for each simulation 

- Executing the simulations 
Approaches 

- Roll your own scripts 

- Use general tools 

- Use application-specific tools 

Natural application for distributed systems 

- Independent tasks with no intercommunication 

- Grids enable 
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Scripted Parameter Study 


Write a script to run your parameter study 

- Complexity increases as you run more and more 

• Command line parameters, better error checking, better 
resource selection, ... 

- IPG services can help with this a bit 

Example of a perl script that uses globusrun to 
execute several INS3D runs on different IPG 
systems 




Parameter Study Tools 


These tools do much of the work for you 

- Selecting where to execute 

- Error detection and recovery 

- Easy specification of parameter studies 

ILab 

- General parameter study tool 

AeroDB 

- Parameter study tool for running CFD applications 

Tool Agent Framework in Java (TAF-J) 

- Perform various aerospace analysis tasks, including 
parameter studies 


i0" 


- ILab - 

Production-Level Distributed Parametric 
Study Capabilities for the Grid 

M. Yarrow 
K. McCann 
A. DeVivo 
C. Fedalizo 

NASA Ames Research Center 


http://www.nas.nasa.gov/ILab 

yarrow@nas.nasa.gov 

mccann@nas.nasa.gov 
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I Lab Features 


I Lab : The Information Power Grid Virtual Laboratory 


• Automated high throughput on multiple systems 

• Operates with Globus (or without - using scp/ssh) 

• Transparent access to parallel schedulers and libraries 

• Monitoring, logging, and archiving 

• Ease of use: sophisticated GUI with built-in HELP; arbitrary 
parameterization with custom parameterizing editor 

• API for programming access (without GUI) 

• Restart capability for CFD usage 

• Handles complex multistage processes 

• Reduces design cycle time - by a factor of up to 1 00! 

• Simple and rapid out-of-box experience 

• Proven track record of success and user satisfaction: 

- Users at Boeing (CFD applications, Nastran) 

- Used at UC Davis for CFD studies 

- NASA Ames users (Semiconductor Device Modeling, 
Rotorcraft blade modeling) 

- Industry (Circuit Simulation) 



Parameterization made simple and 
easy 

I Lab is really a code generator 
generates and submits scripts to 
execute parameterized jobs 
Absolutely no programming or 
scripting required 
Several Job Models, including 
Globus, PBS, LSF, Condor. MPI, 
local or remote, "ganged' 
Real-time monitoring of job status, 
job stopping, masking 
Organization and archiving of all 
"Experiment" data 
MRU and complete "history" 
secretanal Features 
Built with Object-Oriented 
programming model 
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AeroDB Experiments 



• Evaluate Liquid Glide Back Booster 

• Performed a week of runs in September 

• 5964 jobs 

• Approximately 57065 hours of CPU time 

• 1 0 computer systems 

- SGI Origins, Sun SMP, Linux cluster 

• 4 centers 

- Most of the work performed on large Ames Origins 

t» M.4.KU 


Tool Agent Framework - Java 
(TAF-J) 



Tool Agent Framework-Java 



• A Java framework was for running aerospace analyses on 
the IPG. 

• Analyses supported: 

- Geometry: modify control surface deflection 

- CFD: run CFD analysis for parameter space 

- Trim: compute trimmed lift and drag from existing data using 
regression 

- Trajectory: run trajectory tool 

- Structural: run CFD analysis, then run structural analysis using 
CFD loads 

- AbortGC: do complete abort trajectory analysis 

• Tools available: 

- Pro/Engineer, VORVIEW, CBAERO, FLOWCART, AIRPLANE, 
OVERFLOW-D, GASP, ANSYS 

• Problem: Abort analysis of reusable launch vehicle 


TAF-J Example 
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FLOWCART solution 
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Grid Miner 


Developed as one of the early applications on the 
IPG 

- Helped debug the IPG 

- Provided basis for satisfying one of two major IPG 
milestones last year 

Provides basis for what could be an on-going 
Grid Mining Service 





Starting Point fnr Grid Miner 


Grid Miner reused code from object-oriented ADaM data mining 
system 

- Developed under NASA grant at the University of Alabama in 
Huntsville 


- Implemented in C++ as stand-alone, objected -oriented mining 
system 

• Runs on NT, IRIX, Linux 

- Has been used to support research personnel at the Global 
Hydrology and Climate Center and a few other sites. 

• Object-oriented nature of ADaM provided excellent base for 
enhancements to transform ADaM into Grid Miner 


Transforming Stand-Alone Data 
M iner i n to G rid M iner 

• Original stand-alone miner had 459 C++ classes. 

• Had to make small modifications to 5 classes and 
added 3 new classes 

• Grid commands added for 

- Staging miner agent to remote sites 

- Moving data to mining processor 
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NASA 


How to define Metadata 

Commands to move data in and out of jobs 

API to incorporate calls within our sample 

programs 


What we have talked about today 
How do I get Help 

- Describe IPG Support Model 

- IPG Web Pages, references 


IPG Launch Pad 


Launch Pad Interface 


Launch Pad 

- A general web portal to access 
IPG resources and services 
• Start/monitor/stop 
applications, move ffles. 
information about resources. 


- In the future, launch pad code 
can be used as a base to build 
application-specific portals 

‘MyGRID” 

- Project to maintain user 
specific information that is 
accessible throughout the grid 
either via a portal such as 
LaunchPad or via the 
command line 


f 

How to Use SRB on IPG 

(■r 


V ' \ 

NASA 

i2i 

/ 

Summary 








